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Abstract 

Searches for new resonant phenomena on top of a continuum spectra need 
to make assumptions regarding the shape of the continuum spectra. The 
Ansatz function method used in previous searches for new particles decaying 
to dijets is investigated and found to have inherent biases that cause the 95% 
confidence limits on the signal cross-section to be miss-measured by up to 30 
to 50%. 
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Previous searches for hypothetical particles produced in proton-anti-proton collisions 
which decay to two-jets |]-||] have utilized a procedure where the QCD inclusive two-jet 
mass spectrum is fitted as a linear combination of an ansatz function and a signal line 
shape to determine limits on the production cross section for various theoretical particles. 
The reliability of this procedure depends on the ability of the ansatz function to correctly 
model the mass spectrum of the continuum background. This paper describes a test of this 
procedure. 

For this test the QCD inclusive two-jet mass spectrum resulting from pp collisions at 
y/s = 1.8 TeV was simulated using the Next-to-leading order (NLO) event generator JE- 
trad || and the CTEQ4M || parton distribution function (pdf), a renormalization scale (/i) 
of 0.5E^ ax where £"™ ax is the maximum jet transverse energy (Et) of the event and a parton 
clustering algorithm. For the clustering algorithm, all partons within 1.37?- (1Z = \fff + 2 ) 
of one another were combined if they were also within 1Z = 0.7 of their Et weighted cen- 
troid where rj = — ln[tan(#/2)], & is the polar angle relative to the proton beam and <ft is 
the azimuthal angle. The Et of the jets are then smeared using typical collider jet resolutions 
||. The two-jet mass for each event was calculated using the two highest Et jets assuming 
that the jets are massless, using the relationship; M 2 = 2E^ n 7^ et2 [cosh(A??)-cos(A0)]. The 
two leading Et were required to satisfy the requirements | rj iet \ < 1.0, A77 =| rji — r] 2 |< 1.6, 
and M > 200 GeV/c 2 . The effects of statistical fluctuations in the prediction were removed 
by fitting to an ansatz function with the form: 



A ■ M~ a exp 



1 - 



M 
1800 



7 



M 
1800 



-5 



M 
1800 



V n (M) 



(1) 



where V n (M) is a polynomial of degree n. The D0 Collaboration has demonstrated || that 
this simulation is a good representation of its measured Dijet Mass Spectrum. 

To correctly model the statistical fluctuations typical of collider data samples, one hun- 
dred mass spectra were generated in 10 GeV/c 2 mass bins by smearing the spectrum obtained 
above with the Poisson fluctuations expected for the cross-section given by the simulation 
per bin times the luminosity for that bin. Four different mass regions, each with a different 
luminosity, were used to mimic data taking conditions, 0.5 pb" 1 for 200 < M < 270 GeV/c 2 ; 
5. pb" 1 for 270 < M < 370 GeV/c 2 ; 50. pb" 1 for 370 < M < 500 GeV/c 2 ; and 100. pb" 1 
for M > 500 GeV/c 2 . An example of one of the randomly generated spectra and the source 
distribution are shown in Fig. |l|. 

Three different ansatz functions will be investigated. The first is 
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where A is the normalization parameter and a and (3 are fit parameters. Two additional 
ansatz functions were investigated. The first (equation |]) was used by the CDF collabora- 
tion 0] and the second (equation |]) by the UA2 collaboration |l|||: 
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The ansatz functions were all fitted to the 100 simulated spectra using a binned maximum 
likelihood method and the minuit |10| package. The resulting average y 2 f° r the fits is 59 
for 52 data bins. The residuals ([data - fit] /fit) for each of the fits are depicted in Fig. ||. 
These residuals show that the ansatz functions do represent the mass spectra well with no 
obvious biases. 

The 100 simulated spectra were then fitted with a linear combination of the ansatz 
function and a signal line shape, 
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where Ax is the number of signal events expected from a hypothetical particle for the total 
luminosity and Si(q*) is the fraction of signal events in a given mass bin i for a given signal 



mass. For this analysis the signal line shape is given by that of an excited quark (q*) [11 
which decays to a quark and a gluon (q* — > qg). The coupling parameters of the excited 
quark theory were set equal to one (/ = /' = f s = 1.0) and the compositeness scale was 
set equal to the mass of the excited quark (A* = Mg*). The q* were simulated with masses 
from 200 to 975 GeV/ 2 at 25 GeV/c 2 intervals using the pythia |L2| event generator. 



The resulting particle jets were smeared with the assumed jet resolutions. Each q* sample 
contains fifty thousand events . Examples of q* mass spectra are shown in Fig [I| 

For each of the one hundred background spectra the signal size Nx and its error AiVx 
was determined for each of the q* masses generated. The average values of Ax (Ax) and 
ANx (AN%) were then calculated by fitting the distribution of iVx values obtained from the 
100 spectra to a Gaussian distribution (see Fig. Nx is given by the central value of the 
Gaussian and AAx is given by the width of the Gaussian. 

The 95% confidence limit (CL) on the excited quark production cross section was then 
determined by assuming that the probability density as a function of cross section is given 
by a Gaussian with a center Nx/ C and width ANx/ C The 95% CL on the cross section 
(°"95%) is then given by the value of the cross section such that 95% of the physical part 
(i.e. the cross section is greater than zero) of the probability density function is below this 
value. 

Since the spectra being fitted are derived from a NLO QCD calculation, the value of Nx 
should be consistent with zero. Any deviation is the result of an inherent bias in the ansatz 
method. Figure |] shows the fitted Ax versus dijet mass. The error on Ax is taken to be 
ANx/ v99) and it is clear that there are systematic biases in the value of Ax as a function 
of signal (q*) mass. 

If there were no bias in the ansatz fitting method the 95% CL on the production cross 
section would be given by 1.96 x ANx- A measure of the bias in the ansatz is the the 
difference between (Tq^o/ and 1.96 x AA" X : 
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The resulting biases are given in Table | and are depicted in Figure |5| (solid circles). The 
plot clearly shows that the resulting 95% CL on the production cross section can be miss- 
measured by up to ±50%. 

To see if these biases are caused by the specific choice of ansatz function, the two other 
possibilities were examined (equation |3| and f|). The bias values for these ansatz functions 
are given in Table [lj and are plotted in Fig. ^. It is clear that all three of the ansatz 
functions investigated produce a large bias in the resulting 95% CL on the cross-section of 
up to 50%. For a large range of mass values all of the ansatz functions produce biases of the 
same sign and approximate value and in some cases none of the limits will include the true 
95% CL on the cross section. As limits are being set on new particle it is important to note 
that for a significant number of the mass values investigated the ansatz function method 
underestimates the 95% CL. 

The bias in the ansatz function method is caused by the changing slope of the dijet 
mass spectrum and the presence of a parameterized signal function. This signal function 
gives the ansatz function a flex point at which it can adapt to the changing slope of the 
data. The signal line shape fills a gap between the ansatz and the data producing a low x 2 
fit. Therefore the requirement that the ansatz function fits the data with a small \ 2 is not 
sufficient to show that the method is unbiased. It is also necessary to show that the method 
does not find a nonexistent signal. 

The systematic uncertainties reported in previous searches |]-§J are 50 to 300% depend- 
ing on mass of the hypothetical particle. Hence, the limits reported by these searches will 
not be significantly degraded by this additional uncertainty. The bias in the method will 
prevent significant reduction of the systematic uncertainties in the future. The alternative 
to using an ansatz function to represent the background in these searches is a theoretical 
prediction (for example jetrad). Currently the uncertainties in these predictions are 30- 
40% [|l3j due to choice of renormalization scale and pdf. The effect of this uncertainty will 
have to be included in any future searches (this uncertainty can be reduced by improvements 
in the accuracy of pdf's and theoretical calculations). 

In conclusion it has been shown that the ansatz method of searching for new particles 
has an inherent bias in the method and this bias has to be accounted for when placing limits 
on the production cross-sections of new particle production. 

I thank my colleagues on the D0 experiment for their helpful comments, suggestions 
and discussions. This work was supported in part by the Department of Energy under grant 
DE-FG02-91ER4 0684 at Northwestern University. 
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TABLE I. The 95% Confidence Limits obtained as a function of the mass of the q* . Presented 
are the limits calculated from the fit using Nx and ANx, the limit calculated using 1.96 x AJVx 
and the bias. 
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TABLE II. The Biases for all three ansatz functions investigated. (Equation ^, Equation || and 
Equation [| ) 
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M (GeV/c 2 ) 

FIG. 1. The QCD Dijet Mass Spectrum produced by the jetrad program (solid line) and one 
of the randomly generated spectra (solid circles). Note: That the randomly generated spectra has 
been corrected so that all points have the same apparent luminosity. Also shown are the simulated 
q* at masses of 300, 500 and 700 GeV/c 2 (dash-dot lines). 
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FIG. 2. The values of the residuals ([data - fit] /fit) for each of ansatz functions fitted to the 
data with no signal function. The solid circles (a) show the results of the fits using the ansatz of 
equation pi the open squares (b) show the results for Equation ||, and the open triangles show the 
results for Equation ||. 
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FIG. 4. The values of Nx (solid circles), the uncertainty of iVx=AJVx/v 99 (solid error bars) and 
ANx (dotted error bars) obtained from fitting the simulated dijet mass spectra with Equation [2[ 
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FIG. 5. The Bias in the ansatz function method. The solid circles (a) show the bias due to 
ansatz of Equation p|, the open squares (b) show the bias due to the ansatz of Equation ^ and the 
open triangles (c) show the bias due to the ansatz of Equation |||. 
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